Machine Learning Models and Data-Balancing Techniques for Credit Scoring: What Is the Best Combination?
نویسندگان
چکیده
Forecasting the creditworthiness of customers is a central issue banking activity. This task requires analysis large datasets with many variables, for which machine learning algorithms and feature selection techniques are crucial tool. Moreover, percentages “good” “bad” typically imbalanced such that over- undersampling should be employed. In literature, most investigations tackle these three issues individually. Since there little evidence about their joint performance, in this paper, we try to fill gap. We use five classifiers, each them combined different various data-balancing approaches. According empirical retail credit bank dataset, find best combination given by random forests, forest recursive elimination oversampling.
منابع مشابه
Machine Learning Models for Housing Prices Forecasting using Registration Data
This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...
متن کاملWhat Is the Best Research Globally?
Dr. Afshari’s editorial gives insight on human nature (1). Collaborative willingness usually occurs when different sides perceive personal benefit, or when institutions mandate collaboration. This is not unique between higher and lower income countries; many working in global emergency medicine (EM) observe this within wealthier countries. It exists among collaborators in low resource settings....
متن کاملCredit scoring in banks and financial institutions via data mining techniques: A literature review
This paper presents a comprehensive review of the works done, during the 2000–2012, in the application of data mining techniques in Credit scoring. Yet there isn’t any literature in the field of data mining applications in credit scoring. Using a novel research approach, this paper investigates academic and systematic literature review and includes all of the journals in the Science direct onli...
متن کاملConsumer credit scoring models with limited data
In this paper we design the neural network consumer credit scoring models for financial institutions where data usually used in previous research are not available. We use extensive primarily accounting data set on transactions and account balances of clients available in each financial institution. As many of these numerous variables are correlated and have very questionable information conten...
متن کاملWhat is the Clinical Skills Learning Center?
With shorter periods of hospitalazation, fewer patient beds and more health care facilities in the society, patients are now more acutely ill and highly dependent, causing less opportunities for medical students to practice and learn basic clinical skills. On the other hand, enhanced patient rights and other learnig limitations require that professional education provide not only knowledge and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Risks
سال: 2022
ISSN: ['2227-9091']
DOI: https://doi.org/10.3390/risks10090169